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SEQUENCE PROTOCOL 



(l)NGENERAL INFORMATION: 

(K APPLICANT: 

\ (A) NAME: Boehringer Mannheim GmbH 
\b) ROAD: Sandhoferstr . 116 
(O0 CITY: Mannheim 

(E) \COUNTRY: DE 

(F) POSTAL CODE: 68305 

(G) TELEPHONE: 06217595482 

(H) TEL/FAX: 06217594457 

(ii) TITLE OF INVENTION: Polymerase chimeras 
(iii) NUMBER OF SEQUENCES: 14 

(iv) COMPUTER READABLE FORM: 

(A) DATA CARRIER^ Floppy disk 

(B) COMPUTER: IBmVc compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1-0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: \ 

(i) SEQUENCE CHARACTERISTICS \ 

(A) LENGTH: 2733 base pairs\ 

(B) TYPE: nucleotide \ 

(C) STRANDEDNESS: single stra\d 

(D) TOPOLOGY: linear \ 

(ii) TYPE OF MOLECULE: genomic DNA \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:\ 

ATGAGGGGCT CGCATCACCA TCACCATCAC GCTGCTGACG ATQACGATAA AATGAGGGGC 60 

ATGCTACCGC TATTTGAGCC CAAGGGCCGG GTCCTCCTGG TCGAOGGCCA CCACCTGGCC 120 

TACCGCACCT TCCACGCCCT GAAGGGCCTC ACCACCAGCC GGGGGGAGCC GGTGCAGGCG 180 

GTCTACGGCT TCGCCAAGAG CCTCCTCAAG GCCCTCAAGG AGGACGGGGA CGCGGTGATC 240 

GTGGTCTTTG ACGCCAAGGC CCCCTCCTTC CGCCACGAGG CCTACGGGGgVjTACAAGGCG 300 

GGCCGGGCCC CCACGCCGGA GGACTTTCCC CGGCAACTCG CCCTCATCAA *G^AGCTGGTG 360 

GACCTCCTGG GGCTGGCGCG CCTCGAGGTC CCGGGCTACG AGGCGGACGA CGTCCCTGGCC 420 

AGCCTGGCCA AGAAGGCGGA AAAGGAGGGC TACGAGGTCC GCATCCTCAC CGCCOACAAA 480 

GACCTTTACC AGCTCCTTTC CGACCGCATC CACGTCCTCC ACCCCGAGGG GTACCTOATC 54 0 

ACCCCGGCCT GGCTTTGGGA AAAGTACGGC CTGAGGCCCG ACCAGTGGGC CGACTACC^G 600 

GCCCTGACCG GGGACGAGTC CGACAACCTT CCCGGGGTCA AGGGCATCGG GGAGAAGACCA 660 



GCGAGGAAGC TTCTGGAGGA GTGGGGGAGC CTGGAAGCCC TCCTCAAGAA CCTGGACCGG 



<J20 
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CTGAAGCCCG CCA^^GGGA GAAGATCCTG GCCCACATGG^^ATCTGAA GCTCTCCTGG 78 0 
GACCTGGCCA AGGTGCGCAC CGACCTGCCC CTGGAGGTGG ACTTCGCCAA AAGGCGGGAG 84 0 
CCCGACCGGG AGAGGCTTAG GGCCTTTCTG GAGAGGCTTG AGTTTGGCAG CCTCCTCCAC 900 
GAGTTCGGCC TTCTGGAAAG CCCCTATGAC AACTACGTCA CCATCCTTGA TGAAGAAACA 960 
CTGAAAGCGT GGATTGCGAA GCTGGAAAAA GCGCCGGTAT TTGCATTTGA TACCGAAACC 1020 
GACAGCCTTG ATAACATCTC TGCTAACCTG GTCGGGCTTT CTTTTGCTAT CGAGCCAGGC 1080 
GTAGCGGCAT ATATTCCGGT TGCTCATGAT TATCTTGATG CGCCCGATCA AATCTCTCGC 114 0 
GAGCGTGCAC TCGAGTTGCT AAAACCGCTG CTGGAAGATG AAAAGGCGCT GAAGGTCGGG 1200 
CAAAACCTGA AATACGATCG CGGTATTCTG GCGAACTACG GCATTGAACT GCGTGGGATT 1260 
GCGTTTGATA CCATGCTGGA GTCCTACATT CTCAATAGCG TTGCCGGGCG TCACGATATG 132 0 
GACAGCCTCG CGGAACG.TTG GTTGAAGCAC AAAACCATCA CTTTTGAAGA GATTGCTGGT 1380 

* 

AAAGGCAAAA ATCAACTGAC CTTTAACCAG ATTGCCCTCG AAGAAGCCGG ACGTTACGCC 1440 
GCCGAAGATG CAGATGTCAC CTTGCAGTTG CATCTGAAAA TGTGGCCGGA TCTGCAAAAA 1500 
CACGAGAGGC TCCTTTGGCT TTACCGGGAG GTGGAGAGGC CCCTTTCCGC TGTCCTGGCC 1560 
CACATGGAGG CCACGGGGGT GCGCCTGGAC GTGGCCTATC TCAGGGCCTT GTCCCTGGAG 1620 
GTGGCCGAGG AGGTCGCCCG CCTCGAGGCC GAGGTCTTCC GCCTGGCCGG CCACCCCTTC 1680 
AACCTCAACT CCCGGGACCA GCTGGAAAGG GTCCTCTTTG ACGAGCTAGG GCTTCCCGCC 1740 
ATCGGCAAGA CGGAGAAGAC CGGCAAGCGC TCCACCAGCG CCGCCGTCCT GGAGGCCCTC 1800 
CGCGAGGCCC ACCCCATCGT GGAGAAGATC CTGCAGTACC GGGAGCTCAC CAAGCTGAAG 1860 
AGCACCTACA TTGACCCCTT GCCGGACCTC ATCCACCCCA GGACGGGCCG CCTCCACACC 192 0 
CGCTTCAACC AGACGGCCAC GGCCACGGGC AGGCTAAGTA GCTCCGATCC CAACCTCCAG 198 0 
AACATCCCCG TCCGCACCCC GCTTGGGCAG AGGATCCGCC GGGCCTTCAT CGCCGAGGAG 204 0 
GGGTGGCTAT TGGTGGCCCT GGACTATAGC CAGATAGAGC TCAGGGTGCT GGCCCACCTC 2100 
TCCGGCGACG AGAACCTGAT CCGGGTCTTC CAGGAGGGGC GGGACATCCA CACGGAGACC 2160 
GCCAGCTGGA TGTTCGGCGT CCCCCGGGAG GCCGTGGACC CCCTGATGCG CCGGGCGGCC 2220 
AAGACCATCA ACTTCGGGGT CCTCTACGGC ATGTCGGCCC ACCGCCTCTC CCAGGAGCTA 2280 
GCCATCCCTT ACGAGGAGGC CCAGGCCTTC ATTGAGCGCT ACTTTCAGAG CTTCCCCAAG 2340 
GTGCGGGCCT GGATT GAGAA GACCCTGGAG GAGGGCAGGA GGCGGGGGTA CGl-GGAGACC 2400 
CTCTTCGGCC GCCGCCGCTA CGTGCCAGAC CTAGAGGCCC GGGTGAAGAG CGTGCGGGAG 2 4 60 
GCGGCCGAGC GCATGGCCTT CAACATGCCC GTCCAGGGCA CCGCCGCCGA CCTCATGAAG 2520 
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CTGGCTATGG TGAAGCTCTT CCCCAGGCTG GAGGAAATGG GGGCCAGGAT GCTCCTTCAG 2580 

GTCCACGACG AGCTGGTCCT CGAGGCCCCA AAAGAGAGGG CGGAGGCCGT GGCCCGGCTG 2640 

GCCAAGGAGG TCATGGAGGG GGTGTATCCC CTGGCCGTGC CCCTGGAGGT GGAGGTGGGG 2700 

ATAGGGGAGG ACTGGCTCTC CGCCAAGGAG TGA 2733 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 2733 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single strand 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: genomic DNA 
' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATGAGGGGCT CGCATCACCA TCACCATCAC GCTGCTGACG ATGACGATAA AATGAGGGGC 60 
ATGCTACCGC TATTTGAGCC CAAGGGCCGG GTCCTCCTGG TCGACGGCCA CCACCTGGCC 120 
TACCGCACCT TCCACGCCCT GAAGGGCCTC ACCACCAGCC GGGGGGAGCC GGTGCAGGCG 180 
GTCTACGGCT TCGCCAAGAG CCTCCTCAAG GCCCTCAAGG AGGACGGGGA CGCGGTGATC 240 
GTGGTCTTTG ACGCCAAGGC CCCCTCCTTC CGCCACGAGG CCTACGGGGG GTACAAGGCG 300 
GGCCGGGCCC CCACGCCGGA GGACTTTCCC CGGCAACTCG CCCTCATCAA GGAGCTGGTG 360 
GACCTCCTGG GGCTGGCGCG CCTCGAGGTC CCGGGCTACG AGGCGGACGA CGTCCTGGCC 420 
AGCCTGGCCA AGAAGGCGGA AAAGGAGGGC TACGAGGTCC GCATCCTCAC CGCCGACAAA 480 
GACCTTTACC AGCTCCTTTC CGACCGCATC CACGTCCTCC ACCCCGAGGG GTACCTCATC 540 
ACCCCGGCCT GGCTTTGGGA AAAGTACGGC CTGAGGCCCG ACCAGTGGGC CGACTACCGG 600 
GCCCTGACCG GGGACGAGTC CGACAACCTT CCCGGGGTCA AGGGCATCGG GGAGAAGACG 660 
GCGAGGAAGC TTCTGGAGGA GTGGGGGAGC CTGGAAGCCC TCCTCAAGAA CCTGGACCGG 720 
CTGAAGCCCG CCATCCGGGA GAAGATCCTG GCCCACATGG ACGATCTGAA GCTCTCCTGG 780 
GACCTGGCCA AGGTGCGCAC CGACCTGCCC CTGGAGGTGG ACTTCGCCAA AAGGCGGGAG 840 
CCCGACCGGG AGAGGCTTAG GGCCTTTCTG GAGAGGCTTG AGTTTGGCAG CCTCCTCCAC 900 
GAGTTCGGCC TTCTGGAAAG CCCCTATGAC AACTACGTCA CCATCCTTGA TGAAGAAACA 960 
CTGAAAGCGT GGATTGCGAA GCTGGAAAAA GCGCCGGTAT TTGCATTTGA TACCGAAACC 1020 
GACAGCCTTG ATAACATCTC TGCTAACCTG GTCGGGCTTT CTTTTGCTAT CGAGCCAGGC 1080 
GTAGCGGCAT ATATTCCGGT TGCTCATGAT TATCTTGATG CGCCCGATCA AATCTCTCGC 1140 
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GAGCGTGCAC TCG£^JfecT AAAACCGCTG CTGGAAGATG ^^VGGCGCT GAAGGTCGGG 1200 
CAAAACCTGA AATACGATCG CGGTATTCTG GCGAACTACG GCATTGAACT GCGTGGGATT 1260 
GCGTTTGATA CCATGCTGGA GTCCTACATT CTCAATAGCG TTGCCGGGCG TCACGATATG 1320 
GACAGCCTCG CGGAACGTTG GTTGAAGCAC AAAACC AT C A CTTTTGAAGA GATTGCTGGT 1380 
AAAGGCAAAA ATCAACTGAC CTTTAACCAG ATTGCCCTCG AAGAAGCCGG ACGTTACGCC 1440 
GCCGAAGATG CAGATGTCAC CTTGCAGTTG CATCTGAAAA TGTGGCCGGA TCTGCAAAAA 1500 
CACAAAGGGC CGTTGAACGT CTTCGAGAAT ATCGAAATGC CGCTGGTGCC GGTGCTTTCA 1560 
CGCATTGAAC GTAACGGTGT GCGCCTGGAC GTGGCCTATC TCAGGGCCTT GTCCCTGGAG 1620 
GTGGCCGAGG AGATCGCCCG CCTCGAGGCC GAGGTCTTCC GCCTGGCCGG CCACCCCTTC 1680 
AACCTCAACT CCCGGGACCA GCTGGAAAGG GTCCTCTTTG ACGAGCTAGG GCTTCCCGCC 17 40 
ATCGGCAAGA CGGAGAAGAC CGGCAAGCGC TCCACCAGCG CCGCCGTCCT GGAGGCCCTC 1800 
CGCGAGGCCC ACCCCATCGT GGAGAAGATC CTGCAGTACC GGGAGCTCAC CAAGCTGAAG 18 60 
AGCACCTACA TTGACCCCTT GCCGGACCTC ATCCACCCCA GGACGGGCCG CCTCCACACC 1920 
CGCTTCAACC AGACGGCCAC GGCCACGGGC AGGCTAAGTA GCTCCGATCC CAACCTCCAG 1980 
AACATCCCCG TCCGCACCCC GCTTGGGCAG AGGATCCGCC GGGCCTTCAT CGCCGAGGAG 2040 
GGGTGGCTAT TGGTGGCCCT GGACTATAGC CAGATAGAGC TCAGGGTGCT GGCCCACCTC 2100 
TCCGGCGACG AGAACCTGAT CCGGGTCTTC CAGGAGGGGC GGGACATCCA CACGGAGACC 2160 
GCCAGCTGGA TGTTCGGCGT CCCCCGGGAG GCCGTGGACC CCCTGATGCG CCGGGCGGCC 2220 
AAGACCATCA ACTTCGGGGT CCTCTACGGC ATGTCGGCCC ACCGCCTCTC CCAGGAGCTA 2280 
GCCATCCCTT ACGAGGAGGC CCAGGCCTTC ATTGAGCGCT ACTTTCAGAG CTTCCCCAAG 234 0 
GTGCGGGCCT GGATTGAGAA GACCCTGGAG GAGGGCAGGA GGCGGGGGTA CGTGGAGACC 2400 
CTCTTCGGCC GCCGCCGCTA CGTGCCAGAC CTAGAGGCCC GGGTGAAGAG CGTGCGGGAG 2460 
GCGGCCGAGC GCATGGCCTT CAACATGCCC GTCCAGGGCA CCGCCGCCGA CCTCATGAAG 2520 
CTGGCTATGG TGAAGCTCTT CCCCAGGCTG GAGGAAATGG GGGCCAGGAT GCTCCTTCAG 2 58 0 
GTCCACGACG AGCTGGTCCT CGAGGCCCCA AAAGAGAGGG CGGAGGCCGT GGCCCGGCTG 2 64 0 
GCCAAGGAGG TCATGGAGGG GGTGTATCCC CTGGCCGTGC CCCTGGAGGT GGAGGTGGGG 2700 
ATAGGGGAGG ACTGGCTCTC CGCCAAGGAG TGA 2733 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2727 base pairs 

(B) Type: nucleotide 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATGAGGGGCT CGCATCACCA TCACCATCAC GCTGCTGACG ATGACGATAA AATGAGGGGC 60 
ATGCTACCGC TATTTGAGCC CAAGGGCCGG GTCCTCCTGG TCGACGGCCA CCACCTGGCC 120 
TACCGCACCT TCCACGCCCT GAAGGGCCTC ACCACCAGCC GGGGGGAGCC GGTGCAGGCG 180 
GTCTACGGCT TCGCCAAGAG CCTCCTCAAG GCCCTCAAGG AGGACGGGGA CGCGGTGATC 240 
GTGGTCTTTG ACGCCAAGGC CCCCTCCTTC CGCCACGAGG CCTACGGGGG GTACAAGGCG 300 
GGCCGGGCCC CCACGCCGGA GGACTTTCCC CGGCAACTCG CCCTCATCAA GGAGCTGGTG 360 
GACCTCCTGG GGCTGGCGCG CCTCGAGGTC CCGGGCTACG AGGCGGACGA CGTCCTGGCC 420 
AGCCTGGCCA AGAAGGCGGA AAAGGAGGGC TACGAGGTCC GCATCCTCAC CGCCGACAAA 480 
GACCTTTACC AGCTCCTTTC CGACCGCATC CACGTCCTCC ACCCCGAGGG GTACCTCATC 540 
ACCCCGGCCT GGCTTTGGGA AAAGTACGGC CTGAGGCCCG ACCAGTGGGC CGACTACCGG 600 
GCCCTGACCG GGGACGAGTC CGACAACCTT CCCGGGGTCA AGGGCATCGG GGAGAAGACG 660 
GCGAGGAAGC TTCTGGAGGA GTGGGGGAGC CTGGAAGCCC TCCTCA^GAA CCTGGACCGG 7 20 
CTGAAGCCCG CCATCCGGGA GAAGATCCTG GCCCACATGG ACGATCTGAA GCTCTCCTGG 780 
GACCTGGCCA AGGTGCGCAC CGACCTGCCC CTGGAGGTGG ACTTCGCCAA AAGGCGGGAG 840 
CCCGACCGGG AGAGGCTTAG GGCCTTTCTG GAGAGGCTTG AGTTTGGCAG CCTCCTCCAC 900 
GAGTTCGGCC TTCTGGAAAG CCCCCCCGTT GGATACAGAA TAGTGAAAGA CCTGGTGGAA 960 
TTTGAAAAAC T CAT AG AG AA AC T G AG AG AA TCCCCTTCGT TCGCCATAGA TCTTGAGACG 1020 
TCTTCCCTCG ATCCTTTCGA CTGCGACATT GTCGGTATCT CTGTGTCTTT CAAACCAAAG 1080 
GAAGCGTACT ACATACCACT CC AT CAT AG A AACGCCCAGA ACCTGGATGA AAAAGAAGTT 1140 
CTGAAAAAGC TAAAAGAAAT CCTGGAGGAC CCCGGAGCAA AGATCGTTGG TCAGAATTTG 1200 
AAATTCGATT ACAAGGTGTT GATGGTAAAG GGTGTTGAAC CTGTCCCTCC TCACTTCGAC 1260 
ACGATGATAG CGGCTTACCT TCTTGAGCCG AACGAAAAGA AGTTCAATCT GGACGATCTC 1320 
GCATTGAAAT TTCTTGGATA CAAAATGACC TCTTACCAGG AACTCATGTC CTTCTCTTCT 1380 
CCGCTGTTTG GTTTCAGTTT TGCCGATGTT CCTGTAGAAA AAGCAGCGAA CTATTCCTGT 144 0 
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GAAGATGCCG ACATCACCTA CAGACTCTAC AAGATCCTGA GCTTAAAACT CCACGAGGAG 15 00 
AGGCTCCTTT GGCTTTACCG GGAGGTGGAG AGGCCCCTTT CCGCTGTCCT GGCCCACATG 15 60 
GAGGCCACGG GGGTGCGCCT GGACGTGGCC TATCTCAGGG CCTTGTCCCT GGAGGTGGCC 1620 
GAGGAGATCG CCCGCCTCGA GGCCGAGGTC TTCCGCCTGG CCGGCCACCC CTTCAACCTC 1680 
AACTCCCGGG ACCAGCTGGA AAGGGTCCTC TTTGACGAGC TAGGGCTTCC CGCCATCGGC 17 40 
AAGACGGAGA AGACCGGCAA GCGCTCCACC AGCGCCGCCG TCCTGGAGGC CCTCCGCGAG 1800 
GCCCACCCCA TCGTGGAGAA GATCCTGCAG TACCGGGAGC TCACCAAGCT GAAGAGCACC 18 60 
TACATTGACC CCTTGCCGGA CCTCATCCAC CCCAGGACGG GCCGCCTCCA CACCCGCTTC 1920 
AACCAGACGG CCACGGCCAC GGGCAGGCTA AGTAGCTCCG ATCCCAACCT CCAGAACATC 1980 
CCCGTCCGCA CCCCGCTTGG GCAGAGGATC CGCCGGGCCT TCATCGCCGA GGAGGGGTGG 2040 
CTATTGGTGG CCCTGGACTA TAGCCAGATA GAGCTCAGGG TGCTGGCCCA CCTCTCCGGC 2100 
GACGAGAACC TGATCCGGGT CTTCCAGGAG GGGCGGGACA TCCACACGGA GACCGCCAGC 2160 
TGGATGTTCG GCGTCCCCCG GGAGGCCGTG GACCCCCTGA TGCGCCGGGC GGCCAAGACC 2220 
ATCAACTTCG GGGTCCTCTA CGGCATGTCG GCCCACCGCC TCTCCCAGGA GCTAGCCATC 22 80 
CCTTACGAGG AGGCCCAGGC CTTCATTGAG CGCTACTTTC AGAGCTTCCC CAAGGTGCGG 2340 
GCCTGGATTG AGAAGACCCT GGAGGAGGGC AGGAGGCGGG GGTACGTGGA GACCCTCTTC 24 00 
GGCCGCCGCC GCTACGTGCC AGACCTAGAG GCCCGGGTGA AGAGCGTGCG GGAGGCGGCC 24 60 
GAGCGCATGG CCTTCAACAT GCCCGTCCAG GGCACCGCCG CCGACCJCAT GAAGCTGGCT 2520 
ATGGTGAAGC TCTTCCCCAG GCTGGAGGAA ATGGGGGCCA GGATGCTCCT TCAGGTCCAC 25 80 
GACGAGCTGG TCCTGGAGGC CCCAAAAGAG AGGGCGGAGG CCGTGGCCCG GCTGGCCAAG 2 640 
GAGGTCATGG AGGGGGTGTA TCCCCTGGCC GTGCCCCTGG AGGTGGAGGT GGGGATAGGG 27 00 
GAGGACTGGC TCTCCGCCAA GGAGTGA 2727 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2727 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ATGAGGGGCT CGCATCACCA TCACCATCAC GCTGCTGACG ATGACGATAA AATGAGGGGC 60 
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ATGbf'kcCGC TAT^^GCC CAAGGGCCGG GTCCTCCTGG ^^ACGGCCA CCACCTGGCC 120 
TACCGCACCT TCCACGCCCT GAAGGGCCTC ACCACCAGCC GGGGGGAGCC GGTGCAGGCG 180 
GTCTACGGCT TCGCCAAGAG CCTCCTCAAG GCCCTCAAGG AGGACGGGGA CGCGGTGATC 240 
GTGGTCTTTG ACGCCAAGGC CCCCTCCTTC CGCCACGAGG CCTACGGGGG GTACAAGGCG 300 
GGCCGGGCCC CCACGCCGGA GGACTTTCCC CGGCAACTCG CCCTCATCAA GGAGCTGGTG 360 
GACCTCCTGG GGCTGGCGCG CCTCGAGGTC CCGGGCTACG AGGCGGACGA CGTCCTGGCC 420 
AGCCTGGCCA AGAAGGCGGA AAAGGAGGGC TACGAGGTCC GCATCCTCAC CGCCGACAAA 480 
GACCTTTACC AGCTCCTTTC CGACCGCATC CACGTCCTCC ACCCCGAGGG GTACCTCATC 540 
ACCCCGGCCT GGCTTTGGGA AAAGTACGGC CTGAGGCCCG ACCAGTGGGC CGACTACCGG 600 
GCCCTGACCG GGGACGAGTC CGACAACCTT CCCGGGGTCA AGGGCATCGG GGAGAAGACG 660 
GCGAGGAAGC TTCTGGAGGA GTGGGGGAGC CTGGAAGCCC TCCTCAAGAA CCTGGACCGG 720 
CTGAAGCCCG CCATCCGGGA GAAGATCCTG GCCCACATGG ACGATCTGAA GCTCTCCTGG 780 
GACCTGGCCA AGGTGCGCAC CGACCTGCCC CTGGAGGTGG ACTTCGCCAA AAGGCGGGAG 840 
CCCGACCGGG AGAGGCTTAG GGCCTTTCTG GAGAGGCTTG AGTTTGGCAG CCTCCTCCAC 900 
GAGTTCGGCC TTCTGGAAAG CCCCCCCGTT GGATACAGAA TAGTGAAAGA CCTGGTGGAA 960 
TTTGAAAAAC TCATAGAGAA ACTGAGAGAA TCCCCTTCGT TCGCCATAGA TCTTGAGACG 1020 
TCTTCCCTCG ATCCTTTCGA CTGCGACATT GTCGGTATCT CTGTGTCTTT CAAACCAAAG 1080 
GAAGCGTACT ACATACCACT CCATCATAGA AACGCCCAGA ACCTGGATGA AAAAGAAGTT 1140 
CTGAAAAAGC TAAAAGAAAT CCTGGAGGAC CCCGGAGCAA AGATCGTTGG TCAGAATTTG 1200 
AAATTCGATT ACAAGGTGTT GATGGTAAAG GGTGTTGAAC CTGTCCCTCC TCACTTCGAC 1260 
ACGATGATAG CGGCTTACCT TCTTGAGCCG AACGAAAAGA AGTTCAATCT GGACGATCTC 1320 
GCATTGAAAT TTCTTGGATA CAAAATGACC TCTTACCAGG AACTCATGTC CTTCTCTTCT 1380 
CCGCTGTTTG GTTTCAGTTT TGCCGATGTT CCTGTAGAAA AAGCAGCGAA CTATTCCTGT 1440 
GAAGATGCAG ACATCACCTA CAGACTCTAC AAGATCCTGA GCTTAAAACT CCACGAGGCA 1500 
GATCTGGAGA ACGTGTTCTA CAAGATAGAA ATGCCTCTTG TGAGCGTGCT TGCACGGATG 1560 
GAACTGAACG GTGTGCGCCT GGACGTGGCC TATCTCAGGG CCTTGTCCCT GGAGGTGGCC 1620 
GAGGAGATCG CCCGCCTCGA GGCCGAGGTC TTCCGCCTGG CCGGCCACCC CTTCAACCTC 1680 
AACTCCCGGG ACCAGCTGGA AAGGGTCCTC TTTGACGAGC TAGGGCTTCC CGCCATCGGC 1740 
AAGACGGAGA AGACCGGCAA GCGCTCTACC AGCGCCGCCG TCCTGGAGGC CCTCCGCGAG 1800 
GCCCACCCCA TCGTGGAGAA GATCCTGCAG TACCGGGAGC TCACCAAGCT GAAGAGCACC 1860 
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TACATTGACC 



CCT 



1GA 



CCTCATCCAC CCCAGGACGG 



tCCTCCA CACCCGCTTC 1920 



AACCAGACGG CCACGGCCAC GGGCAGGCTA AGTAGCTCCG ATCCCAACCT CCAGAACATC 1980 



CTATTGGTGG CCCTGGACTA TAGCCAGATA GAGCTCAGGG TGCTGGCCCA CCTCTCCGGC 2100 
GACGAGAACC TGATCCGGGT CTTCCAGGAG GGGCGGGACA TCCACACGGA GACCGCCAGC 2160 
TGGATGTTCG GCGTCCCCCG GGAGGCCGTG GACCCCCTGA TGCGCCGGGC GGCCAAGACC 2220 
ATCAACTTCG GGGTCCTCTA CGGCATGTCG GCCCACCGCC TCTCCCAGGA GCTAGCCATC 2280 
CCTTACGAGG AGGCCCAGGC CTTCATTGAG CGCTACTTTC AGAGCTTCCC CAAGGTGCGG 2340 
GCCTGGATTG AGAAGACCCT GGAGGAGGGC AGGAGGCGGG GGTACGTGGA GACCCTCTTC 2400 
GGCCGCCGCC GCTACGTGCC AGACCTAGAG GCCCGGGTGA AGAGCGTGCG GGAGGCGGCC 24 60 
GAGCGCATGG CCTTCAACAT GCCCGTCCAG GGCACCGCCG CCGACCTCAT GAAGCTGGCT 2520 
ATGGTGAAGC TCTTCCCCAG GCTGGAGGAA ATGGGGGCCA GGATGCTCCT TCAGGTCCAC 2580 
GACGAGCTGG TCCTCGAGGC CCCAAAAGAG AGGGCGGAGG CCGTGGCCCG GCTGGCCAAG 2640 
GAGGTCATGG AGGGGGTGTA TCCCCTGGCC GTGCCCCTGG AGGTGGAGGT GGGGATAGGG 27 00 
GAGGACTGGC TCTCCGCCAA GGAGTGA 2727 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2850 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULS : genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATGAGGGGCT CGCATCACCA TCACCATCAC GCTGCTGACG ATGACGATAA . AATGAGGGGC 60 

ATGCTACCGC TATTTGAGCC CAAGGGCCGG GTCCTCCTGG TCGACGGCCA CCACCTGGCC 120 

TACCGCACCT TCCACGCCCT GAAGGGCCTC ACCACCAGCC GGGGGGAGCC GGTGCAGGCG 180 

GTCTACGGCT TCGCCAAGAG CCTCCTCAAG GCCCTCAAGG AGGACGGGGA CGCGGTGATC 2 40 

GTGGTCTTTG ACGCCAAGGC CCCCTCCTTC CGCCACGAGG CCTACGGGGG GTACAAGGCG 300 

GGCCGGGCCC CCACGCCGGA GGACTTTCCC CGGCAACTCG CCCTCATCAA GGAGCTGGTG 3 60 

GACCTCCTGG GGCTGGCGCG CCTCGAGGTC CCGGGCTACG AGGCGGACGA CGTCCTGGCC 420 

AGCCTGGCCA AGAAGGCGGA AAAGGAGGGC TACGAGGTCC GCATCCTCAC CGCCGACAAA 4 80 

GACCTTTACC AGCTCCTTTC CGACCGCATC CACGTCCTCC ACCCCGAGGG GTACCTCATC 5 40 



CCCGTCCGCA 



CCCCGCTTGG 



GCAGAGGATC CGCCGGGCCT TCATCGCCGA GGAGGGGTGG 2040 
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ACCCCGGCCT 


GGCTTTGGGA 


AAAGTAPnr;p 


v^ 1 vji-ivjvjv^v_,v_,vj 


AOCAvjl vjvjGC 


CGACTACCGG 


600 


GCCCTGACCG 


GGGACGAGTC 


CGAPAAfrTT 


v^v_,v^vjvjoo1 v^A 


AVjVJVjOAl CGG 


GGAGAAGACG 


660 


GGGAGGAAGC 


TTCTGGAGGA 


GTGGGGGAGP 


V-' 1 VJVJ/-U-1VJV_,V J .V^ 


1 CO 1 OAAGAA 


CCTGGACCGG 


720 


CTGAAGCCCG 


CCATCCGGGA 


GAAGATPPTft 


nppp APzvTnn: 


fiOvjAl O 1 vjAA 


GCTCTCCTGG 


780 


GACCTGGCCA 


AGGTGCGCAC 


CGACCTGCPP 


V^ 1 OO/ivjO 1 VJVJ 


lVP'T r Por , p , r i 7\7\ 
AO 1 1 OvjCCAA 


AAGGCGGGAG 


840 


CCCGACCGGG 


AGAGGCTTAG 


GGCCTTTCTG 


onorloo\r lib 


-rlo ill vjvjC Avj 


CCTCCTCCAC 


900 


GAGTTCGGCC 


TTCTGGAAAG 


CCCCCATCCA 


r;p Af^TTr^Tpr: 


7A P IV T C* T 1 T 1 P* P* 7A 
■fiOAl Oil OtjA 


TV m TV tv m ti mm 

ATACGATATT 


960 


CCATTTGCAA 


AGAGATACCT 


CATCGACAAA 


nnPPTA AT HP 
vjvjv^O X nnl AO 


P IV IV TTP 7\ f~TT* 
OAA1 OvjAOvjCd 


r~+ /—» TV T\ TV TV J*"t TV ^» 

GGAAGAAGAG 


1020 


CTAAAGATTC 


TTGCCTTCGA 


TATAGAAACC 


PTPTATP APH 
^/i\/lnl v^2-V\>-vj 


Z\ ZVP^f^ZVP' 7A 7V/^7A 


^Tprnm tv tv tv tv 

GTTTGGAAAA 


1080 


GGCCCAATTA 


TAATGATTAG 


TTATGCAGAT 


P A A A ZA Tit: IV 7A (Z 
^r\Jr\r\±\l vjj-lrivj 




TACT TGG AAA 


1140 


AACATAGATC 


TTCCATACGT 


TGAGGTTGTA 

x ufiwu x x vj x n 




vjiivjAbAI GAT 


AAAGAGATTT 


1200 


CTCAGGATTA 


TCAGGGAGAA 


GGATPPTf^AP 


Z\TTIVTIVr ,rnrn 7A 
ri 1 1 A 1 Avj 1 1/i 


CI 1A1AATGG 


AGACTCATTC 


1260 


GACTTCCCAT 


ATTTAGCGAA 


AAGGGPAGAA 


A A APTTPPPIV 
ri/irlv_, 1 1 vjvjvjA 


r P r P7\7\7\ r nm7\ t\ r~* 
1 1 KHJ\ 1 1 AAC 


CATTGGAAGA 


1320 


GATGGAAGCG 


AGCCCAAGAT 


GCAGAGAATA 


f^PJPPlAT ATPZV 


OLjvjU 1 vj 1 ACjA 


AGTCAAGGGA 


1380 


AGAATACATT 


TCGACTTGTA 


TCATGTAATA 

x i \ x \j x lui x n 


AP A AC^ZiP A A 


liAAAl U 1 LLt 


AACATACACA 


1440 


CTAGAGGCTG 


TATATGAAGC 


AATTTTTftHA 

'"i X X X X X vj Vj^i 


A APPP A A APP 
rlravjv^. Orlrvrlvj vj 


AvjAACjCj 1 ATA 


CGCCGACGAG 


1500 


ATAGCAAAAG 


CCTGGGAAAG 


TPinAnAr^zxzip 


1^1 1 vj Avj Avj Avj 


rp m f* f~~i i~> H Tl T\ m TV 

ITGCCAAATA 


CTCGATGGAA 


1560 


GATGCAAAGG 


CAACTTATGA 


ACTPGf}f;AAA 


o/\ril 1 v_,0 1 1L* 


P^7\ 7\ r Tir~*r* , 7\ tv Tvm 
UAA1 VJVjr ( AAAT 


TCAGCTTTCA 


1620 


GAGAGGCTCC 


TTTGGCTTTA 


CCGGGAGGTG 


HAPAPPPPPP 


1 1 1 L-Ovjv^ i<j i 


CCIGGCCCAC 


1680 


ATGGAGGCCA 


CGGGGGTGCG 


CCTGGACGTG 


PPPTATPTP A 


vjvjvjv_,L* 1 1 VJ 1 U 


CCTGGAGGTG 


i "i a rv 

17 4 0 


GCCGAGGAGA 


TCGCCCGCCT 


CGAGGPPGAG 


ftTPTTPPPPP 
valval ILLbLL 


1 vjvjOOvjvjCCA 


CCCCTTCAAC 


1800 


CTCAACTCCC 


GGGACCAGCT 


GGAAAPIf^TP 




T\ pprp tv r~* /-* rn 

ACjU 1 AGGGCT 


TCCCGCCATC 


1860 


GGCAAGACGG 


AGAAGACCGG 


v/nnouou x v^v^ 


zv P r* zv rp r*r*r* c 


CCGTCCTGGA 


GGCCCTCCGC 


1920 


GAGGCCCACC 


CCATCGTGGA 


GAAGATPPTf, 


p zv rzT El p p r*pp 


AvjU I U ACCAA 


f~*TT\ f* TV TV /"t TV 

GCTGAAGAGC 


1980 


ACCTACATTG 


ACCCCTTGCC 


GGAPPTPATP 

uunuv x X Vw 


p zv pp pp i\ r* r* tv 


UvjvjvjCCGCCT 


CCACACCCGC 


2040 


TTCAACCAGA 


CGGCCACGGC 


CACGGGCAGG 


PTZvzvnTivnr'T 

v^ J. rlf-ivj 1 noL/ 1 


r ,, P*P*7\ r P/™«/ w '/* ,, 7V TV 

OUvjAI cccaa 


/~<fTl/^ TV f~* TV TV *~l 

CCTCCAGAAC 


2100 


ATCCCCGTCC 


GCACCCCGCT 


TGGGCAGAGG 


ATPP CHHCHC 


bbl 1 CA1 CGC 


CGAGGAGGGG 


2160 


TGGCTATTGG 


TGGCCCTGGA 


CTATAGCCAG 


ATAGAGCTCA 


GGGTGCTGGC 


CCACCTCTCC 


2220 


GGCGACGAGA 


ACCTGATCCG 


GGTCTTCCAG 


GAGGGGCGGG 


ACATCCACAC 


GGAGACCGCC 


2280 


AGCTGGATGT 


TCGGCGTCCC 


CCGGGAGGCC 


GTGGACCCCC 


TGATGCGCCG 


GGCGGCCAAG 


2340 
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ACCATCAACT TCGGGGTCCT CTACGGCATG TCGGCCCACC GCCTCTCCCA GGAGCTAGCC 2400 

ATCCCTTACG AGGAGGCCCA GGCCTTCATT GAGCGCTACT TTCAGAGCTT CCCCAAGGTG 24 60 

CGGGCCTGGA TTGAGAAGAC CCTGGAGGAG GGCAGGAGGC GGGGGTACGT GGAGACCCTC 2520 

TTCGGCCGCC GCCGCTACGT GCCAGACCTA GAGGCCCGGG TGAAGAGCGT GCGGGAGGCG 2580 

GCCGAGCGCA TGGCCTTCAA CATGCCCGTC CAGGGCACCG CCGCCGACCT CATGAAGCTG 2640 

GCTATGGTGA AGCTCTTCCC CAGGCTGGAG GAAATGGGGG CCAGGATGCT CCTTCAGGTC 2700 

CACGACGAGC TGGTCCTCGA GGCCCCAAAA GAGAGGGCGG AGGCCGTGGC CCGGCTGGCC 27 60 

AAGGAGGTCA TGGAGGGGGT GTATCCCCTG GCCGTGCCCC TGGAGGTGGA GGTGGGGATA 2820 

GGGGAGGACT GGCTCTCCGC CAAGGAGTGA 2850 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2949 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
ATGAGGGGCT CGCATCACCA TCACCATCAC GCTGCTGACG ATGACGATAA AATGAGGGGC 60 
ATGCTACCGC TATTTGAGCC CAAGGGCCGG GTCCTCCTGG TCGACQGCCA CCACCTGGCC 120 



GTCTACGGCT TCGCCAAGAG CCTCCTCAAG GCCCTCAAGG AGGACGGGGA CGCGGTGATC 24 0 

GTGGTCTTTG ACGCCAAGGC CCCCTCCTTC CGCCACGAGG CCTACGGGGG GTACAAGGCG 300 

GGCCGGGCCC CCACGCCGGA GGACTTTCCC CGGCAACTCG CCCTCATCAA GGAGCTGGTG 3 60 

GACCTCCTGG GGCTGGCGCG CCTCGAGGTC CCGGGCTACG AGGCGGACGA CGTCCTGGCC 4 20 

AGCCTGGCCA AGAAGGCGGA AAAGGAGGGC TACGAGGTCC GCATCCTCAC CGCOGACAAA 480 

GACCTTTACC AGCTCCTTTC CGACCGCATC CACGTCCTCC ACCCCGAGGG GTACCTCATC 540 

ACCCCGGCCT GGCTTTGGGA AAAGTACGGC CTGAGGCCCG ACCAGTGGGC CGACTACCGG 600 

GCCCTGACCG GGGACGAGTC CGACAACCTT CCCGGGGTCA AGGGCATCGG GGAGAAGACG 660 

GCGAGGAAGC TTCTGGAGGA GTGGGGGAGC CTGGAAGCCC TCCTCAAGAA CCTGGACCGG 720 

CTGAAGCCCG CCATCCGGGA GAAGATCCTG GCCCACATGG ACGATCTGAA GCTCTCCTGG 7 80 

GACCTGGCCA AGGTGCGCAC CGACCTGCCC CTGGAGGTGG ACTTCGCCAA AAGGCGGGAG 8 40 



TACCGCACCT 



TCCACGCCCT 



GAAGGGCCTC 



ACCACCAGCC GGGGGGAGCC GGTGCAGGCG 



180 




CCCGACCGGG AGAGGCTTAG GGCCTTTCTG 
GAGTTCGGCC TTCTGGAAAG CCCCGTTAGA 
. TACGATATTC CATTTGCAAA GAGATACCTC 
GAAGAAGAGC TAAAGATTCT TGCCTTCGAT 
TTTGGAAAAG GCCCAATTAT AATGATTAGT 
ACTTGGAAAA ACATAGATCT TCCATACGTT 
AAGAGATTTC TCAGGATTAT CAGGGAGAAG 
GACTCATTCG ACTTCCCATA TTTAGCGAAA 
ATTGGAAGAG ATGGAAGCGA GCCCAAGATG 
GTCAAGGGAA GAATACATTT CGACTTGTAT 
ACATACACAC TAGAGGCTGT ATATGAAGCA 

O 

gccgacgaga tagcaaaagc ctgggaaagt 

jjj TCGATGGAAG ATGCAAAGGC AACTTATGAA 

CAGCTTTCAA GATTAGTTGG ACAACCTTTA 

I |i i 

\U CTTGTAGAGT GGTTCTTACT TAGGAAAGCC 

^ CCAAGTGAAG AGGAGTATCA AAGAAGGCTC 

Q 

|y ctggacgtgg cctatctcag ggccttgtcc 

m gaggccgagg tcttccgcct ggccggccac 

jt gaaagggtcc tctttgacga gctagggctt 

aagcgctcca ccagcgccgc cgtcctggag 
aagatcctgc agtaccggga gctcaccaag 
gacctcatcc accccaggac gggccgcctc 
acgggcaggc taagtagctc cgatcccaac 
gggcagagga tccgccgggc cttcatcgcc 
tatagccaga tagagctcag ggtgctggcc 
gtcttccagg aggggcggga catccacacg 
cgggaggccg tggaccccct gatgcgccgg 
tacggcatgt: cggcccaccg cctctcccag 
gccttcattg agcgctactt tcagagcttc 
ctggaggagg gcaggaggcg ggggtacgtg 



09/623326 

# 

GAGAGGCTTG AGTTTGGCAG CCTCCTCCAC 900 
GAACATCCAG CAGTTGTGGA CATCTTCGAA 960 
ATCGACAAAG GCCTAATACC AATGGAGGGG 1020 
ATAGAAACCC TCTATCACGA AGGAGAAGAG 1080 
TATGCAGATG AAAATGAAGC AAAGGTGATT 1140 
GAGGTTGTAT CAAGCGAGAG AGAGATGATA 1200 
GATCCTGACA TTATAGTTAC TTATAATGGA 1260 
AGGGCAGAAA AACTTGGGAT TAAATTAACC 1320 
CAGAGAATAG GCGATATGAC GGCTGTAGAA 1380 
CATGTAATAA CAAGGACAAT AAATCTCCCA 1440 
ATTTTTGGAA AGCCAAAGGA GAAGGTATAC 1500 
GGAGAGAACC TTGAGAGAGT TGCCAAATAC 15 60 
CTCGGGAAAG AATTCCTTCC AATGGAAATT 1620 
TGGGATGTTT CAAGGTCAAG CACAGGGAAC 1680 
TACGAAAGAA ACGAAGTAGC TCCAAACAAG 1740 
AGGGAGAGCT ACACAGGTGG ATTCGTGCGC 1800 
CTGGAGGTGG CCGAGGAGAT CGCCCGCCTC 18 60 
CCCTTCAACC TCAACTCCCG GGACCAGCTG 1920 
CCCGCCATCG GCAAGACGGA GAAGACCGGC 1980 
GCCCTCCGCG AGGCCCACCC CATCGTGGAG 2040 
CTGAAGAGCA CCTACATTGA CCCCTTGCCG 2100 
CACACCCGCT TCAACCAGAC GGCCACGGCC 2160 
CTCCAGAACA TCCCCGTCCG CACCCCGCTT 2220 
GAGGAGGGGT GGCTATTGGT GGCCCTGGAC 2280 
CACCTCTCCG GCGACGAGAA CCTGATCCGG 2340 
GAGACCGCCA GCTGGATGTT CGGCGTCCCC 2400 
GCGGCCAAGA CCATCAACTT CGGGGTCCTC 2460 
GAGCTAGCCA TCCCTTACGA GGAGGCCCAG 2520 
CCCAAGGTGC GGGCCTGGAT TGAGAAGACC 2580 
GAGACCCTCT TCGGCCGCCG CCGCTACGTG 2640 
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CCAGACCTAG AGGCC^^GGT GAAGAGCGTG CGGGAGGCGG C^^GCGCAT GGCCTTCAAC 27 00 
ATGCCCGTCC AGGGCACCGC CGCCGACCTC ATGAAGCTGG CTATGGTGAA GCTCTTCCCC 27 60 
AGGCTGGAGG AAATGGGGGC CAGGATGCTC CTTCAGGTCC ACGACGAGCT GGTCCTCGAG 2820 
GCCCCAAAAG AGAGGGCGGA GGCCGTGGCC CGGCTGGCCA AGGAGGTCAT GGAGGGGGTG 2880 
TATCCCCTGG CCGTGCCCCT GGAGGTGGAG GTGGGGATAG GGGAGGACTG GCTCTCCGCC 2940 

2949 

AAGGAGTGA 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single strand 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 7: 

Met Arq Gly Ser His His His His His His Ala Ala Asp Asp Asp Asp 
15 10 I 5 

Lvs Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
20 25 30 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
35 40 45 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
50 55 60 ^ 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 



Lys 

65 70 



75 80 



Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 

85 90 95 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
100 105 -HO 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
115 120 125 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
130 135 140 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
145 150 155 160 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
165 170 175 



Glv Tyr L^^fcle Thr Pro Ala Trp Leu Trp^^u Lys Tyr Gly Leu Arg 

^80 185 190 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
195 200 205 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
210 215 220 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
225 230 235 240 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 



245 



250 



255 



Lvs Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
260 265 270 

Val Asp. Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 



275 



280 



285 



Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
290 295 300 

Leu Glu Ser Pro Tyr Asp Asn Tyr Val Thr He Leu Asp Glu Glu Thr 
305 310 315 320 

Leu Lys Ala Trp He Ala Lys Leu Glu Lys Ala Pro Val Phe Ala Phe 
325 330 335 

Asp Thr Glu Thr Asp Ser Leu Asp Asn He Ser Ala Asn Leu Val Gly 
340 345 350 

Leu Ser Phe Ala He Glu Pro Gly Val Ala Ala Tyr He Pro Val Ala 
355 360 365 

His Asp Tyr Leu Asp Ala Pro Asp Gin He Ser Arg Glu Arg Ala Leu 
370 375 380 

Glu Leu Leu Lys Pro Leu Leu Glu Asp Glu Lys Ala Leu Lys Val Gly 
385 390 395 400 

Gin Asn Leu Lys Tyr Asp Arg Gly lie Leu Ala Asn Tyr Gly He Glu 
405 . 410 415 

Leu Arg Gly He Ala Phe Asp Thr Met Leu Glu Ser Tyr ' He Leu Asn 
420 425 . 430 

Ser Val Ala Gly Arg His Asp Met Asp Ser Leu Ala Glu Arg Trp Leu 
435 440 445 

Lys His Lys Thr He Thr Phe Glu Glu He Ala Gly Lys Gly Lys Asn 
450 455 460 

Gin Leu Thr Phe Asn Gin He Ala Leu Glu Glu Ala Gly Arg Tyr Ala 
465 " 470 ■ 475 480 

Ala Glu Asp Ala Asp Val Thr Leu Gin Leu His Leu Lys Met Trp Pro 
485 490 495 
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Tyr Arg Glu Val Glu 
-500 50b 510 



Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr Gly Val Arg 
515 520 525 

Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala Glu Glu 
530 535 540 

Val Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His Pro Phe 
545 550 555 560 

Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu 
565 570 575 

Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser Thr 
580 585 590 

Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He Val Glu 
595 600 605 

Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr He 
610 615 620 

Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg Leu His Thr 
625 630 635 640 

Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
645 650 655 

Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg He 
660 665 670 

Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala Leu Asp 
675 680 685 

Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu 
690 695 700*. 

Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr Glu Thr 
705 710 715 720 

Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met 
725 730 735 

Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met Ser 
740 745 750 

Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Gin 
755 760 765 

Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp 
770 775 780- 

He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu Thr 
785: 790 : 795 ; 800 

Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg Val Lys 
805 810 815 
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Ser Val ArJ^Pu Ala Ala Glu Arg Met Ala ^1 Asn Met Pro Val Gin 

820 825 830 

Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro 
835 840 845 

Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His Asp Glu 
850 855 860 

Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala Arg Leu 
865 870 875 880 

Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro Leu Glu 
885 890 895 

Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Glu 
900 905 910 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Arg Gly Ser His His His His His His Ala Ala Asp Asp Asp Asp 
1 5 10 15 

Lys Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 

20 25 , 30 

-> 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
35 40 45 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
50 55 60 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
65 70 75 * 80 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
85 90 95 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
100 105 HO 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
115 120 125 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
130 135 140 
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Lys Ala Gl^Pys Glu Gly Tyr Glu Val Arg Leu Thr Ala Asp Lys 

145 150 155 160 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 

165 170 175 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
180 185 190 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
195 200 205 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
210 215 220 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
225 230 235 240 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
245 250 255 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
260 265 270 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
275 280 285 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 

290 295 300 

Leu Glu Ser Pro Tyr Asp Asn Tyr Val Thr He Leu Asp Glu Glu Thr 
305 310 315 320 

Leu Lys Ala Trp He Ala Lys Leu Glu Lys Ala Pro Val Phe Ala Phe 
325 330 335 

Asp Thr Glu Thr Asp Ser Leu Asp Asn He Ser Ala Asn Leu Val Gly 
340 345 • 350 

Leu Ser Phe Ala He Glu Pro Gly Val Ala Ala Tyr He Pro Val Ala 
355 360 365 

His Asp Tyr Leu Asp Ala Pro Asp Gin He Ser Arg Glu Arg Ala Leu 
370 375 380 

Glu Leu Leu Lys Pro Leu Leu Glu Asp Glu Lys Ala Leu Lys Val Gly 
385 390 395 400 

Gin Asn Leu Lys Tyr Asp Arg Gly lie Leu Ala Asn Tyr Gly He Glu 
405 410 415 

Leu Arg Gly He Ala Phe Asp Thr Met Leu Glu Ser Tyr He Leu Asn 
420 425 430 

Ser Val Ala Gly Arg His Asp Met Asp Ser Leu Ala Glu Arg Trp Leu 
435 440 445 

Lys His Lys Thr He Thr Phe Glu Glu He Ala Gly Lys Gly Lys Asn 
450 455 460 
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Gin Leu Tl^^he Asn Gin He Ala Leu Glu^^i Ala Gly Arg Tyr Ala 

465 470 475 480 

Ala Glu Asp Ala Asp Val Thr Leu Gin Leu His Leu Lys Met Trp Pro 

485 490 495 

Asp Leu Gin Lys His Lys Gly Pro Leu Asn Val Phe Glu Asn He Glu 
500 505 510 

Met Pro Leu Val Pro Val Leu Ser Arg He Glu Arg Asn Gly Val Arg 
515 520 525 

Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala Glu Glu 
530 535 540 

He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His Pro Phe 
545 550 555 560 

Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu 

565 570 575 

Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser Thr 
580 585 590 

Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He Val Glu 
5 95 600 605 

Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr He 
610 615 620 

Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg Leu His Thr 
625 630 635 640 

Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 

645 650 655 

• 

Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg He 
660 665 • 670 

Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala Leu Asp 
675 680 685 

Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu 
690 695 700 

Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr Glu Thr 
705 710 715 720 

Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met 
725 730 735 

Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met Ser 
740 745 750 

Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Gin 
: 755 760 ' 765 

Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp 
770 775 780 
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lie Glu Ly^^r Leu Glu Glu Gly Arg Arg Gly Tyr Val Glu Thr 

785 790 795 800 

Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg Val Lys 
805 810 815 

Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin 
820 825 830 

Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro 
835 840 845 

Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His Asp Glu 
850 855 860 

Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala Arg Leu 
865 870 875 880 

Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro Leu Glu 
885 890 895 

Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Glu 
900 905 910 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 908 amino acids 
;^ (B) TYPE: amino acid 

Ui (C) STRANDEDNESS : single strand 

■ . (D) TOPOLOGY: linear 

P 

jy (ii) TYPE OF MOLECULE: protein 

IB (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

□ 

M Met Arg Gly Ser His His His His His His Ala AlauAsp Asp Asp Asp 

15 10 15 

Lys Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
20 25 30 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
35 40 45 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
50 55 60 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
65 70 75 80 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
85 90 95 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
100 105 110 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
115 120 125 
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Glu Val Pr^MLy Tyr Glu Ala Asp Asp Val^H Ala Ser Leu Ala Lys 

130 135 ^ 140 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
145 150 155 160 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
165 170 175 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
180 185 190 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
195 200 205 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
210 215 220 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
225 230 235 240 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
245 250 255 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
260 265 270 

Val Asp . Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
275 280 285 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
290 295 300 

Leu Glu Ser Pro Pro Val Gly Tyr Arg lie Val Lys Asp Leu Val Glu 
305 310 315 320 

Phe Glu Lys Leu lie Glu Lys Leu Arg Glu Ser Pro Ser Phe Ala lie 

325 330 \ 335 

Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asp Cys Asp lie Val Gly 
340 345 350 

lie Ser Val Ser Phe Lys Pro Lys Glu Ala Tyr Tyr lie Pro Leu His 
355 360 365 

His Arg Asn Ala Gin Asn Leu Asp Glu Lys Glu Val Leu* Lys Lys Leu 
370 375 380 

Lys Glu lie Leu Glu Asp Pro Gly Ala Lys lie Val Gly Gin Asn Leu 
385 390 395 400 

Lys Phe Asp Tyr Lys Val Leu Met Val Lys Gly Val Glu Pro Val Pro 
405 410 415 

Pro His Phe Asp Thr Met lie Ala Ala Tyr Leu Leu Glu Pro Asn Glu 
420 425 430 



Lys Lys Phe Asn Leu Asp Asp Leu Ala Leu Lys Phe Leu Gly Tyr Lys 
435 440 445 
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Met Thr S^^Byr Gin Glu Leu Met Ser Ph4^Pr Ser Pro Leu Phe Gly 

450 455 460 



Phe Ser Phe Ala Asp Val Pro Val Glu Lys Ala Ala Asn Tyr Ser Cys 
465 470 475 480 

Glu Asp Ala Asp lie Thr Tyr Arg Leu Tyr Lys lie Leu Ser Leu Lys 

485 490 495 

Leu His Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu Val Glu Arg Pro 
500 505 510 

Leu Ser Ala Val Leu Ala His Met Glu Ala Thr Gly Val Arg Leu Asp 
515 520 525 

Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala Glu Glu lie Ala 
530 535 540 

Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu 
545 550 555 560 

Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Gly Leu 

565 570 575 

Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser Thr Ser Ala 
580 585 590 

Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys lie 
595 600 605 

Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr lie Asp Pro 
610 615 620 

Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg Leu His Thr Arg Phe 
625 630 635 640 

Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn 

64 5 65 0 '~\ 65 5 

Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie Arg Arg 
660 665 670 

Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val Ala Leu Asp Tyr Ser 
675 680 685 

Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp* Glu Asn Leu 
690 695 700 

lie Arg Val Phe Gin Glu Gly Arg Asp lie His Thr Glu Thr Ala Ser 
705 710 715 720 

Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg 

725 730 735 



Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His 
740 745 750 



Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu Ala Gin Ala Phe 
755 760 765 



dj^yr Phe Gin Ser Phe Pro Ly^^ll 
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He Glu Al^Tyr Phe Gin Ser Phe Pro Lys^PTl Arg Ala Trp He Glu 

770 775 780 

Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu Thr Leu Phe 
785 790 795 800 

Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg Val Lys Ser Val 
805 810 815 

Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr 
820 825 830 

Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu 
835 840 845 

Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His Asp Glu Leu Val 
850 855 860 

Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala Arg Leu Ala Lys 
865 870 875 880 

Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro Leu Glu Val Glu 
885 890 895 

Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Glu 
900 905 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 908 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Arg Gly Ser His His His His His His Ala Ala Asp Asp Asp Asp 
15 10 15 

Lys Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
20 25 .30 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
35 40 45 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
50 55 60 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
65 70 75 80 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
85 90 95 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
100 105 HO 
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Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
115 120 125 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
130 135 140 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
145 150 155 160 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
165 170 175 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
180 185 190 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
195 200 205 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
210 215 220 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
225 230 235 240 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
245 250 255 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
260 265 270 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
275 280 285 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
290 295 3.00 

Leu Glu Ser Pro Pro Val Gly Tyr Arg lie Val Lys* Asp Leu Val Glu 
305 310 315 320 

Phe Glu Lys Leu lie Glu Lys Leu Arg Glu Ser Pro Ser Phe Ala lie 
325 330 335 

Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asp Cys Asp lie Val Gly 
340 345 350 

lie Ser Val Ser Phe Lys Pro Lys Glu Ala Tyr Tyr lie Pro Leu His 
355 360 365 

His Arg Asn Ala Gin Asn Leu Asp Glu Lys Glu Val Leu Lys Lys Leu 
370 375 380 

Lys Glu lie Leu Glu Asp Pro Gly Ala Lys lie Val Gly Gin Asn Leu 
385 390 395 400 

Lys Phe Asp Tyr Lys Val Leu Met -Val Lys Gly Val Glu Pro Val Pro 



405 



410 



415 



Pro His Phe Asp Thr Met lie Ala Ala Tyr Leu Leu Glu Pro Asn Glu 
420 425 430 
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Lys Lys Phe Asn Leu Asp Asp Leu Ala Leu Lys Phe Leu Gly Tyr Lys 
435 440 445 

Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Ser Pro Leu Phe Gly 
450 455 460 

Phe Ser Phe Ala Asp Val Pro Val Glu Lys Ala Ala Asn Tyr Ser Cys 
465 470 475 480 

Glu Asp Ala Asp lie Thr Tyr Arg Leu Tyr Lys lie Leu Ser Leu Lys 
485 490 495 

Leu His Glu Ala Asp Leu Glu Asn Val Phe Tyr Lys lie Glu Met Pro 
500 505 510 

Leu Val Ser Val Leu Ala Arg Met Glu Leu Asn Gly Val Arg Leu Asp 
515 520 525 

Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala Glu Glu lie Ala 
530 535 540 

Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu 
545 550 * 555 560 

Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Gly Leu 
565 570 575 

Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser Thr Ser Ala 
580 585 590 

Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys lie 
595 600 605 

Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr lie Asp Pro 
610 615 620 

Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg Leu- His Thr Arg Phe 
625 630 635 640 

Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn 
645 650 655 

Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie Arg Arg 
660 665 670 

Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val Ala Leu. Asp Tyr Ser 
675 680 685 

Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu 
690 695 700 

He Arg Val Phe Gin Glu Gly Arg Asp He His Thr Glu Thr Ala Ser 
705 710 715 720 

Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg 
725 730 735 

Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His 
740 745 750 
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Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu Ala Gin Ala Phe 
755 760 765 

lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp lie Glu 
770 775 780 

Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu Thr Leu Phe 
785 790 795 800 

Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg Val Lys Ser Val 

805 810 815 

Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr 
820 825 830 

Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu 
835 840 845 

Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His Asp Glu Leu Val 
850 855 860 

Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala Arg Leu Ala Lys 
865 870 875 880 

Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro Leu Glu Val Glu 

885 890 895 

Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Glu 
900 905 

INFORMATION FORSEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Arg Gly Ser His His His His His His Ala Ala Asp Asp Asp Asp 
1 5 10 .-15 

Lys Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
20 25 30 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
35 40 45 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
50 55 60 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
65 70 75 80 



Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 

85 90 95 
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Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 



100 



105 



110 



Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
115 120 125 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
130 135 140 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
145 150 155 — 



160 



Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
165 170 175 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 



180 



185 



190 



Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
195 200 one; 



205 



Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
210 215 220 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arq 
225 230 235 240 

Leu Lys Pro Ala He Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
245 250 255 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
260 265 270 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
275 280 , 285 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His'. Glu Phe Gly Leu 
290 295 300 

Leu Glu Ser Pro His Pro Ala Val Val Asp He Phe Glu Tyr Asp He 
305 310 315 320 

Pro Phe Ala Lys Arg Tyr Leu He Asp Lys Gly Leu lie. Pro Met Glu 
325 330 335 

Gly Glu Glu Glu Leu Lys He Leu Ala Phe Asp He Glu Thr Leu Tyr 
340 345 350 

His Glu Gly Glu Glu Phe Gly Lys Gly Pro He He Met He Ser Tyr 
355 360 365 

Ala Asp Glu Asn Glu Ala Lys Val He Thr Trp Lys Asn He Asp Leu 
370 375 380 

Pro Tyr Val Glu Val Val Ser Set Glu Arg Glu Met lie: Lys Arg Phe 
385 390 395 400 

Leu Arg He He Arg Glu Lys Asp Pro Asp He He Val Thr Tyr Asn 
405 410 415 



# 
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Gly Asp Ser Phe Asp Phe Pro Tyr Leu Ala Lys Arg Ala Glu Lys Leu 
420 425 430 



Gly lie Lys Leu Thr lie Gly Arg Asp Gly Ser Glu Pro Lys Met Gin 
435 440 445 

Arg lie Gly Asp Met Thr Ala Val Glu Val Lys Gly Arg lie His Phe 
450 455 460 

Asp Leu Tyr His Val lie Thr Arg Thr lie Asn Leu Pro Thr Tyr Thr 
465 470 475 480 

Leu Glu Ala Val Tyr Glu Ala lie Phe Gly Lys Pro Lys Glu Lys Val 

485 490 495 

Tyr Ala Asp Glu lie Ala Lys Ala Trp Glu Ser Gly Glu Asn Leu Glu 
500 505 510 

Arg Val Ala Lys Tyr Ser Met Glu Asp Ala Lys Ala Thr Tyr Glu Leu . 
515 520 525 

Gly Lys Glu Phe Leu Pro Met Glu lie Gin Leu Ser Glu Arg Leu Leu 
530 535 540 

Trp Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His 
545 550 555 560 

Met Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu 

565 570 575 

Ser Leu Glu Val Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe 
580 585 590 

Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu 
595 600 . 605 

Arg Val Leu Phe Asp Glu Leu Gly Leu Pro Ala lie- Gly Lys Thr Glu 
610 615 620 

Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg 
625 630 635 640 

Glu Ala His Pro. lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr 

645 650 655 

Lys Leu Lys Ser Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro 
660 665 ' 670 

Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr 
675 680 685 

Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg 
690 695 700 



Thr Pro Leu Gly Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly 
705 710 715 720 



Trp Leu Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu 

725 730 735 
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u :g Val Phe Gin Glu Gly 

740 745 750 

Arg Asp lie His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg 
755 760 765 

Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe 
770 775 780 

Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala 
785 790 795 800 

lie Pro Tyr Glu Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser 
805 810 815 

Phe Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg 
820 825 830 

Arg Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro 
835 840 845 

Asp Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met 
850 855 860 

Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu 
865 870 875 880 

Ala Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met 

885 890 895 

Leu Leu Gin Val His Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg 
900 905 910 

Ala Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr 
915 920 . 925 

Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp 
930 935 940 

Leu Ser Ala Lys Glu 
945 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single strand 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Arg Gly Ser His His His His His His Ala Ala Asp Asp Asp Asp 
15 10 15 



Lys Met A^l^ly Met Leu Pro Leu Phe Gl 

^0 25 



* 
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Lys Gly Arg Val Leu 

30 



Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
35 40 45 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
50 55 60 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
65 70 75 80 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
85 90 95 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
100 105 110 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
115 120 125 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
130 135 140 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
145 150 155 160 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
165 170 175 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
180 185 190 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
195 200 205 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
210 215 22CL 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
225 230 235 240 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
245 250 255 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu ' Pro Leu Glu 
260 265 270 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
275 280 285 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
290 295 300 

Leu Glu Ser Pro Val Arg Glu His Pro Ala Val Val Asp He Phe Glu 
305 310 315 320 



Tyr Asp He Pro Phe Ala Lys Arg Tyr Leu He Asp Lys Gly Leu He 
325 330 335 



;JBly Glu Glu Glu Leu Lys IldB^a 

a n n ACL 
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Pro Met GS^Bly Glu Glu Glu Leu Lys IldHKi Ala Phe Asp He Glu 

^40 345 350 

Thr Leu Tyr His Glu Gly Glu Glu Phe Gly Lys Gly Pro He He Met 
355 360 365 

He Ser Tyr Ala Asp Glu Asn Glu Ala Lys Val He Thr Trp Lys Asn 
370 375 380 

He Asp Leu Pro Tyr Val Glu Val Val Ser Ser Glu Arg Glu Met He 
385 390 395 400 

Lys Arg Phe Leu Arg lie He Arg Glu Lys Asp Pro Asp He He Val 
405 410 415 

Thr Tyr Asn Gly Asp Ser Phe Asp Phe Pro Tyr Leu Ala Lys Arg Ala 
420 425 430 

Glu Lys Leu Gly He Lys Leu Thr He Gly Arg Asp Gly Ser Glu Pro 
435 440 445 

Lys Met Gin Arg He Gly Asp Met Thr Ala Val Glu Val Lys Gly Arg 
450 455 460 

He His Phe Asp Leu Tyr His Val He Thr Arg Thr He Asn Leu Pro 
465 470 475 480 

Thr Tyr Thr Leu Glu Ala Val Tyr Glu Ala He Phe Gly Lys Pro Lys 
485 490 495 

Glu Lys Val Tyr Ala Asp Glu He Ala Lys Ala Trp Glu Ser Gly Glu 
500 505 510 

Asn Leu Glu Arg Val Ala Lys Tyr Ser Met Glu Asp Ala Lys Ala Thr 
515 520 525 

Tyr Glu Leu Gly Lys Glu Phe Leu Pro Met Glu lie Gin Leu Ser Arg 
530 535 540*. 

Leu Val Gly Gin Pro Leu Trp Asp Val Ser Arg Ser Ser Thr Gly Asn 
545 550 555 560 

Leu Val Glu Trp Phe Leu Leu Arg Lys Ala Tyr Glu Arg Asn Glu Val 
.565 570 575 

Ala Pro Asn Lys Pro Ser Glu Glu Glu Tyr Gin Arg Arg *Leu Arg Glu 
580 585 590 

Ser Tyr Thr Gly Gly Phe Val Arg Leu Asp Val Ala Tyr Leu Arg Ala 
595 600 605 

Leu Ser Leu Glu Val Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val 
610 615 620 

Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu 
625 630 635 640 

Glu Arg Val Leu Phe Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr 

645 ' 650 655 



rl: l^ly Lys Arg Ser Thr Ser Ala^^a 
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Glu Lys Tl^ly Lys Arg Ser Thr Ser Ala^a Val Leu Glu Ala Leu 

660 665 670 

Arg Glu Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu 
675 680 685 

Thr Lys Leu Lys Ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His 
690 695 700 

Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala 
705 710 715 720 

Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val 
725 730 735 

Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala Phe He Ala Glu Glu 
740 745 750 

Gly Trp Leu Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val 
755 760 765 

Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu 
770 775 780 

Gly Arg Asp He His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro 
785 7 90 795 800 

Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn 
805 810 815 

Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu 
820 825 830 

Ala He Pro Tyr Glu Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin 
835 840 845 

Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly 

850 855 860 

Arg Arg Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val 
865 870 875 880 

Pro Asp Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg 
885 890 895 

Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp 'Leu Met Lys 
900 905 910 

Leu Ala Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg 
915 920 925 

Met Leu Leu Gin Val His Asp Glu Leu Val Leu Glu Ala Pro Lys Glu 
930 935 940 

Arg Ala Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val 
:945 950 955 960 

Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp 
965 970 975 



Trp Leu Sel 



a Lys Glu 
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(Z^ INFORMATION FOR .SEQ ID NO: 13: 

^i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single strand 
sip) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: other nucleic acid 

(A)\DESCRIPTION: /desc = "oligonucleotide" 

(ix) CHARACTERISTIC: 

(A) NAM^/KEY: CDS 

(B) POSITION: 1. .66 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 
GAA TTC ATG AGG GGC TCG &£T CAC CAT CAC CAT CAC GCT GCT GAC GAT 
GAC GAT AAA ATG AGG GGC 



48 
66 



(2) INFORMATION FOR SEQ ID NO:\14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acNs 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID N(\ 14 

Met Arg Gly Ser His His His His His His Aj\ Ala Asp. Asp Asp Asp 
1 5 10 \ ~\ 15 

Lys Met Arg Gly 
20 



